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Clinical Development Programs for Drugs, Devices, and Biological Products forth' 
Treatment of Rheumatoid Arthritis (RA) 



This guidance document represents the Agency's current thinking on Clinical 
Development Programs for Drugs, Devices, and Biological Products Intended for the 
Treatment of Rheumatoid Arthritis. It does not create or confer any rights for or on any 
person and does not operate to bind FDA or the public* An alternative approach may be 
used if such approach satisfies the requirements of the applicable statute, regulations, 
or both. 
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I. INTRODUCTION 

TWs guidance is intended to assist developers of drugs, biological products, and medical devices Intend 
for the treatment of rheumatoid arthritis (RA). The document discusses the types of label claims that car 
considered for such products and provides guidance on the clinical development programs to support tl* 
claims. 

The central purpose of label claims Is to inform prescribes and patients about the documented benefit 
a product Because RA is a chronic, symptomatic disease that can result in a variety of outcomes with 
different chronologies, severities, and overall patient effects, any number of different clinical outcomes c 
provide the basis for a label claim. 

Relief of symptoms the C signs and symptoms claim C Is a central therapeutic effect of most RA 
therapeutics marketed circa 1997. The claim structure proposed in this document, however, incorporate 
wider range of patient outcomes than previously allowable RA claims. As a result, guidance is provided 
demonstrating patient benefit of greater magnitude than Is needed for a claim of symptomatic relief. For 
example, the claims major clinical response, complete clinical response, and remission (the same criteri 
complete clinical response while off alt antirheumatic drugs) reflect enhanced effects on the signs and 
symptoms of disease. The claim prevention of structural damage is documented by various radiograph!* 
techniques. The claim prevention of disability is intended to reflect longer term benefits on disease cour 
The claims and clinical development programs discussed in this draft guidance for industry represent th 
current views of Agency rheumatologists about achievable and clinically relevant overail outcomes that 
be evaluated in clinical trials. 

Traditionally, RA therapeutics have been categorized as disease modifying antirheumatic drugs (DMAR 
or as nonsteroidal anti-inflammatory drugs (NSAIDs). As a result of the ongoing search for more effectiv 
therapeutics that have a positive impact on the natural history of the disease, promising new therapies z 
currently being tested in the clinic. Many of the novel agents under study for the treatment of RA defy 
categorization by putative mechanism of action. As a result, the usefulness of classifying them in the 
traditional manner may be limited. For this reason, information being provided in labeling about the ons« 
and duration of action and the durability of response of therapeutic interventions is intended to reflect th 
data that were gathered in clinical trials. Because of this, some of the claims described In this document 
incorporate response duration times within their structure. 

Over the past decade, there has been a search for better measures to describe patient outcomes in RA 
clinical trials. A number of organizations, including the European League Against Rheumatism, the 
International League Against Rheumatism, the American College of Rheumatology (ACR), and the Outc 
Measures in Rheumatoid Arthritis Clinical Trials (OMERACT) group, have attempted to define core grot 
of measures, as well as composite indices, that describe patient outcomes. As a result of these efforts, 
several new measures have been described and validated with clinical data. With the hope that these 
measures will provide more useful information about patient outcomes, this document provides guidanc 
about the use of these new measures in clinical trials to support label claims. 

One outcome measure that is not fully relied upon as a stand-alone daim is general health-related qual* 
life (HR-QOL). Since RA affects so many domains of a patient's life, it is hoped that such HR-QOL meas 
may provide an integrated assessment of the long-term impact of intervention. However, not enough 
information is avaOable on the performance of general HR-QOL measures in longer term arthritis trials. ' 
incorporation of such measures in planned trials is encouraged. 

Table of Contents 



II. NEW CLAIMS FOR THE TREATMENT OF RA 

A number of new claims are now being evaluated in clinical trials during drug development. Description; 
the claims and acceptable outcome measures to support each claim are discussed in the following secti 

A. Reduction in the Signs and Symptoms of RA 
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This claim is intended to reflect the demonstration of symptomatic benefit or benefits «nMu^ 
jmirwement in signs of disease activity as well as symptoms. Reduction in signs and s ymptoms 
SSffhS claim granted foVmarketing approval Ordinarily, *is claim 
trials of at least six months' duration, unless the product belongs to an already well-dlaractenzed 
dSlmS^SL (e.g., NSAIDs) for which trials of three months' duration are sufficient to 

RA is a disease of long duration. Interventions that provide only short-term, time-limited benefit a 
unlikely to have overall value to patients. In addition, products with the potential to elicit antibody 
formation should be assessed for durability, since antibodies may block effectiveness. In ©valuati 
signs and symptoms, methods that evaluate response over time are preferable to methods that 
incorporate only the baseline value and the final observation, unless there is a reason to weight 
symptoms at the last visit more than intermediary symptoms. Acceptable outcome measures thai 
would support claim A include: 

1 . Validated composite endpolnts or indices of signs and symptoms 

These composites may be used to construct categorical endpoints for patient success or 
failure. For example, the Paulus criteria (1990) or the ACR definition of Improvement (ACF 
2 could be used to assess a patient's response. 

Illustration: Success for each patient in a six-month trial could be defined as meeting the 
criteria for improvement over baseline in at least four of six monthly observations and not 
dropping out because of toxicity 

2. Well-accepted sets of signs/symptoms measures 

The four measures previously recommended in the 1988 ODER Guideline for the Clinical 
Evaluation of Anti-Inflammatory and Antirheumatic Drugs (FDA 1988) C joint counts: pain, 
tenderness, and swelling and global assessments: physician and patient C or the ACR coi 
set are examples of well-accepted sets of signs and symptoms measures. The criteria for 
success and the methods for statistical analysis should be prospectively defined and agret 
upon. For example, historically, in using joint counts and global assessments, a statistical 
significant difference between the control and the treatment group in change from baseline 
at least three of the four measures has been used as the criterion for a successful trial. 
However, as stated above, comparison of only the baseline and last observation may not I 
the best way to construct the analysis since this method leaves out all Intervening efficacy 
observations. 

For both the above measures, using 66 or 28 joint count Is appropriate (Smolen 1995). 

B. Major Clinical Response 

This daim is intended to reflect the demonstration of a continuous six-month period of success b; 
"ACR 70,- which is defined entirely parallel to the ACR 20, except 70 percent improvement, rathe 
than 20 percent is needed for the component assessed, This claim is based on statistically signr 
improvement in response rates by the continuous six-month ACR 70 definition compared to 
background therapy in a randomized control group. For reference, the number of patients satisfyi 
various definitions of ACR responses from ACR 20 to ACR 70 in two historic databases are giver 
Appendix A of this document. Trial duration should be a minimum of seven months for an agent 
expected to have a rapid onset of action and longer for agents with less prompt effects. 



C. Complete Clinical Response 

This claim is intended to describe a therapeutic benefit of greater magnitude than the major clinic 
response claim. Complete clinical response and remission (see below) are identically defined as 
continuous six-month period of both remission by ACR criteria and radiographic arrest (no 
radiographic progression [Larsen 1977] or modified Sharp methods [1985]). Complete clinical 
response connotes a benefit requiring ongoing drug therapy; remission is defined by the same re 
while off all antirheumatic drugs. The 1981 ACR remission criteria (Pinals 1981) require at least f 
of the following: morning stiffness less than 15 minutes, no fatigue, no joint pain by history, no joi 
tenderness or pain on motion, no swelling of joints or tendon sheaths, and erythrocyte sediments 
rate (ESR) less than 20 for males or less than 30 for females. The duration of trials designed to 
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support this claim will vary depending upon the rate of onset of effect of "^'^ 
the most rapid-onset agents, trials of one year's duration should be planned. Longer trla s may bi 
needed for very slow-acting agents. Trials evaluating complete clinical response would use a 
categorical endpoint (patient complete response or treatment failure) as the primary outcome 
measure. 



D. Remission 

This claim is defined as both remission byACR criteria and radiographic arrest (no radiographic 
progression by Larsen or modified Sharp method) over a continuous six-month period while off a 
antirheumatic therapy. Remission is not intended to imply cure, and a remission claim could be 
granted even if patients relapse after six months or more of remission. The duration of trials desij 
to support a remission claim will vary, depending upon the rate of onset of effect of the test prodt 
Ordinarily sudi trials should be at least one year in duration, and longer trials may be needed for 
slow-acting agents. 



E Prevention of Disability 

This claim is intended to encourage long-term trials in RA. Currently, the Health Assessment 
Questionnaire (HAQ) (Fries 1982) and the Arthritis Impact Measure Scales (AIMS) (Meenan 19ft 
are adequately validated measures for use as the primary outcome measure in these trials. Studi 
should be two to five years in duration. Sponsors seeking this claim should plan to have 
demonstrated previously, or to demonstrate concomitantly, improvement in signs and symptoms. 
Since the full effect of RA on a patient is not captured without the use of more general HR-QOL 
measures, a validated measure such as the SF-36 should also be collected and patients should i 
worsen on these measures over the duration of the trial. 



F. Prevention of Structural Damage 

Prevention of structural damage is an important goal of RA therapy. Trials evaluating this outcom 
should be at least one year in duration. 

The following are examples of outcome measures that could be used to support prevention of 
structural damage claims. 

1 . Slowing X-ray progression, using either the Larsen, the modified Sharp, or another validat 
radiographic index 

Radiographic claims should be based on comparisons of films taken at one year (and 
subsequent yearly points) with those taken at baseline. All randomized patients should ha 
films at both time points, regardless of whether they are continuing treatment Patients 
dropping out of the trial should have films taken at that time. Prespecification of the handlh 
dropouts is especially important in these trials. 

2. Prevention of new X-ray erosions C maintaining an erosion-free state or preventing new 
erosions 

Trials evaluating this claim would ordinarHy use a categorical endpoint to assign a status c 
progression or nonprogresslon to each patient, comparing the final state to the baseline st 



3. Other measurement tools (e.g., MRI) 



Other measures, such as MRI (magnetic resonance imaging) or ultrasonography, could b< 
employed. However, because of the technique's potential for identifying small, albeit 
statistically significant changes, the magnitude of the difference that would reflect actual 
patient benefit is unclear and needs to be established. 

Because stowing of radiographic progression does not in itself define a patient benefit, it is expec 
that the claim of prevention of structural damage would be submitted for an agent that has been 
shown (previously or concomitantly) to be effective for one of the other claims (e.g., prevention oi 
disability). However, some agents are not intended to affect acute inflammation, but are designee 
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prevent or slow joint destruction by other means. The first indication that such an agent is clinical 
useful might be slowing of radiographic progression. Nevertheless, the ultimate goals of slowing) 
destruction are to Improve symptoms and to preserve functional ability. Therefore, slowing 
radiographic progression of disease is considered a surrogate marker for overall patient benefit ir 

Under 21 CFR 314, subpart H and 21 CFR 601 subpart E, FDA can approve drugs Intended to tr 
serious and life-threatening diseases based on an effect on a surrogate marker, provided that cei 
criteria are met and that there is a commitment to define the actual clinical benefit of the agent in 
studies completed after marketing. A demonstration of significant slowing of radiographic progres 
In a seriously affected population of RA patients would qualify for consideration under these 
regulations. Sponsors are urged to consult with the relevant FDA staff before embarking on a clir 
program based on these regulations. 

One example of a significant effect on radiographic progression might be the demonstration, in a 
randomized controlled trial, of maintenance of an erosion-free state in a large majority of treated 
patients when control patients develop multiple erosions. The methods of measurement of the 
radiographic finding, the magnitude of change considered to be clinically significant (whether a b; 
patient measure or treatment group means), and the methods of statistical analysis should be 
prospectively defined in the clinical protocol, and the sponsor should seek Agency concurrence v 
the plan for evaluating efficacy. The use of the accelerated approval pathway would necessitate 
timely completion of phase 4 studies using acceptable clinical endpolnts evaluating signs and 
symptoms or prevention of disability. It is anticipated that these investigations would be extensior 
the one-year studies used for the accelerated approval 

Table of Contents 



III. CONSIDERATIONS IN RA PRODUCT DEVELOPMENT 

The following information on preclinical and early clinical product development pertains primarily to 
pharmaceuticals (drugs and biologies). Except in the first two sections, the general principles outlined bt 
also apply to devices. For information specific to the development of devices, refer to the section in this 
document entitled "Special Considerations for Medical Devices," Developers of products that combine 
therapeutic modalities (e.g., biologies and devices) may request assistance from FDA in designating a It 
center for review of the product. Such requests should be submitted to: Office of the Chief Mediator and 
Ombudsman (HF-7), Food and Drug Administration, 5600 Fishers Lane, Rockville, MD 20857. 

Frequent encountered issues in RA product development include; 

1 . Selecting appropriate in vitro (animal or human systems) and in vivo animal models for screening 
potentially active agents. 

2. Designing and performing appropriate preclinical safety studies to support the use of a new mole 
entity in human volunteers or patients. 

3. Balancing the potential need for therapeutic intervention early in the disease course with the nee* 
avoid exposing patients with mild disease to agents that have toxicities or little record of safety. 

4. Identifying the potential risks associated with combination therapies, particularly those with share 
target organ toxicity or potential for pharmacokinetic interactions. 

5. Designing adequate and practical long-term safety monitoring. 

6. Designing trials that definitively show clinical efficacy. 
The following sections discuss approaches to the above issues. 

A. Preclinical Considerations 

This section focuses on preclinical issues that are specific to the clinical development of 
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antirheumatic therapies. In designing toxicity studies and the timing of such **|^ 
L Agency is recommended concerning the current recommendations and V*^noB^flg0 
dw dertses* and biological products. Guidance on preclinlical safety testing, addressing the r 
fS design of Xox\comrtc%Mv* toxicity, genotoxicity and ^^f^^ has 
been developed by the International Conference on the HamK)nization(ICH of Technical 
Requirements for Pharmaceuticals. Because biologies can pose unique challenges in animal stui 
design (for example, specles*pecific binding or immunogenic^ of human proteins in animals), a 
specific ICH document is available that addresses the safety evaluation of biotechnology-derived 
pharmaceuticals (ICH S6 1997) * 

1. Pharmacokinetics. 

Animal studies of drug absorption, distribution, metabolism, and excretion are important di 
the early investigational new drug (IND) phase to aid In toxicity study interpretation, but ne 
not all be completed prior to phase 1 . Generally, for initial studies in humans, determining 
pharmacokinetic (PK) parameters, such as area under the curve (AUC), maximum 
concentration (C^ ), and half-life (t 1/2 ) in animals, is sufficient to provide a basis for pred* 

safe clinical exposure. 

In the past, preclinical testing of combinations of drugs (or biologies) to be used In patients 
RA has not often been done prior to the initial clinical trials. However, given the variety of 
drugs, including NSAtDs, analgesics, corticosteroids, and DMARDs currently used to treat 
patients, it would be useful to consider this testing of common combinations both preclinlc 
and clinically. In addition, to evaluate potential interactions, information on the impact of 
concomitant therapies on pharmacokinetics may be needed to optimize dosing regimens t 
to identify potential safety concerns. Metabolic interactions often may be assessed in an ir 
vitro system using animal or human liver slices, microsomal preparations, or purified P450 
enzymes (FDA 1997). 

Interactions may also result from the presence of Individual- or disease-specific factors, st 
as rheumatoid factor, which may bind to various monoclonal antibody therapeutics; in sucl 
cases In vitro binding studies that identify patients with high titers may be useful in identify 
patients who may exhibit unique pharmacokinetics or patterns of clinical response. 

2. Biological activity 

The biological activity of a potential antirheumatic therapy should be established using mu 
preclinical model systems (i.e., in vitro, in vivo, ex vivo). In vitro screens can use ceils or 
tissues derived from animal or human sources and are generally used to select candidate 
drugs that have a desired effect on a molecular target Such assays can also be used to 
devise appropriate bioassays for the selected agent Animals, either healthy, with rheumat 
disease (spontaneous or induced), or genetically modified, are subsequently used to 
determine whether the biological effect can be demonstrated In vivo. While the in vivo sysl 
used should mimic one or more aspects of rheumatoid arthritis or its etiology, it is expecte 
that each animal model will have limitations. 

a. In vitro 

Data from in vitro studies can be useful in defining the potential mechanism of actio 
a drug or biologic and for determining relevance of a particular animal species for ir 
assessment of activity or safety. These data are especially useful if a potential sum 
marker can be identified in preclinical studies. For example, if the product is intend* 
affect the CD4 receptor on lymphocytes, this receptor can be used as a surrogate 
marker for both activity and certain toxicities. 

Several in vitro tests could be used, depending on the mechanism of action of the c 
or biologic. For example, binding assays may be useful for developing receptor 
antagonists or monoclonal antibodies. In vitro functional assays (e.g., platelet and 
neutrophil aggregation) may be useful tests for identifying inhibitors of inflammatory 
mediators. Enzymatic assays (e.g., in vitro or ex vivo inhibition of cyctooxygenase. 
lipoxygenase, and phospholipase) may also be useful for determining selectivity for 
inhibition of isozymes. 
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b In vivo Selection of animal models should be made on the basis of pharmacodynan 
(PD) responses, similarity of animal disease etiology to clinical disease, and/or to d 
mechanism-based toxicity. Ideally, products that are targeted for a subset of arthnti 
patients should be developed in an experimental model(s) that is most relevant to tl 
subset. For example, rats are not sensitive to drugs that inhibit 5-lipoxygenase. 
Therefore, mouse or rabbit models are more relevant to evaluate the antWnflammat 
activity of leukotriene inhibitors. 

The development of rheumatic disease models to allow screening for potential RA candid* 
drugs is encouraged. The following examples are meant only to illustrate some models in 
current use and are not intended to suggest excluding the use of others. 

Collagen-induced arthritis (CIA): 

Collagen-inducted arthritis is often considered to be a suitable model for studying potentia 
drugs or biologies active In human rheumatoid arthritis because of the involvement of loca 
major histocompatibility, complete class ll-restricted T helper cell activation, and the simlla 
of histopathological lesions. Radiographs of joints affected by CIA often show erosive chai 
similar to those seen in human rheumatoid arthritis. The progressive arthritis often results 
RA-like joint deformity and dysfunction. Anticollagen antibodies, which occur in some pati€ 
with RA, develop fn the CIA model. 

The CIA model has been useful for assessing immunosuppressants and steroid hormones 
well as inhibitors of inflammatory mediators. Since this model can be induced In several at 
species, it may be especially useful for evaluating drugs that are species-specific (e.g., 
leukotriene antagonists and 5-lipoxygenase inhibitors). In addition, although functional tesr 
are not routinely used in this model, incorporation of measures of mobility and joint functlo 
may enhance the predictive value of the model. 

Naturally occurring arthritis or autoimmune response: 

MRL/lpr mice, Biozzi H mice and DBA/1 mice have been used to examine the onset of dru 
induced tolerance and Immunosuppressant drug effects on autoimmunity. The MRL/lpr mc 
model has been useful for evaluating immunosuppressants and hormones. 

Rat carrageenln-induccd acute model of inflammation: 

This model has been useful in assessing anti-inflammatory activity of cyclooxygenase 
inhibitors. Most of the animal models that involve inflammation in the paw may be used for 
measuring antiphlogistic action of a drug. 

Adjuvant-induced arthritis in rats (AA): 

AA in rats has been frequently used for screening nonsteroidal anti-inflammatory drugs an 
inhibitors of inflammatory cytokines as well as antimetaboliteJike immunosuppressants. 

Streptococcal cell wall-induced arthritis: 

This model has been used for developing cytokine inhibitors. 

Experimental organ transplant in animals: 

This model has been used to identify the activity of immunosuppressants and antlmetaboli 
particularly those directed at cytolytic cellular immune processes. 

Transgenic animal models: 

A number of transgenic animal models are being developed for the study of rheumatoid 
arthritis and may prove useful over the next decade. Some examples include transgenic nr 
that carry genes for the env-Px region of the human T cell leukemia virus type I genome, 
human TNF, CD4, and HLA B-27. 
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3. Toxicology 

Preclinical toxicology studies of a drug or biological product are designed to ^aracterize 
general and specific toxicity using dosing routes and regimens as similar as posstole to th. 
proposed clinical trials with consideration of the demographics ^nd I disease sta us of the 
Wended patient population. For instance, the prevalence of RA is high in females. Theref 
reproduces toxicity studies should be completed early in clinical development to support * 
Inclusion of women of childbearing age in early phases of clinical triate. The need for 
reproductive studies for biological products is likely to be case-speclflc due to cx>mpHcatior 
arising from immunogenlcity and species selectivity. Therefore, standardized study design 
such as those recommended in the ICH reproductive toxicology guidance, may not be fea 
or clinically relevant for biologies (ICH S5 1994). The need, and specific designs, for these 
studies may be discussed with Agency review staff. 

Immunomodulatory or immunosuppressive agents administered to RA patients as 
monotherapy or in combination raise concerns about the adverse effects of prolonged 
Immunosuppression. For example, malignancies (I.e., lymphomas) are a known risk of Ion 
term, nonselective immunosuppression used for treatment of graft recipients. Investigate 
drug-related opportunistic infections and mortality related to immunosuppression have 
occurred in RA patients. Sponsors are encouraged to identify and use animal models that 
assist in selecting drug candidates that selectively inhibit cells and processes responsible 
RA. 

Antirheumatic drugs are often used in combination in an attempt to improve outcomes anc 
minimize toxicities. However, drug interactions may result in increased toxicity, even at lov 
than previously evaluated doses of either agent This concern is especially evident for age 
that have long half-lives or nonselective activity, or for drugs that share common target orj 
toxicity. Preclinical toxicity studies that evaluate the use of combined agents may be helpft 
predicting clinical safety hazards. The duration of toxicity dosing of animals is usually linke 
patient dosing regimens. Development and validation of in vitro or whole animal models is 
encouraged to address concerns regarding short- or long-term toxicity and to identify surr< 
markers for patient immunocompetence. 



B. Pharmacokinetic/Pharmacodynamic Strategies 

In vivo pharmacokinetic studies should be used to evaluate drug disposition and metabolism, dej 
of linearity and accumulation, dose proportionality, and, for oral dosage forms, food interactions ( 
1 992). Some of these data can be gathered in a single study designed to evaluate a number of 
parameters. During formulation development, bioequivalence studies linking formulations may be 
recommended. 

A particular concern with biological agents is the development of antibodies that may accelerate ■ 
clearance or alter Its distribution, resulting in changes in therapeutic benefit over time, or followim 
repeated courses of treatment. To address this consideration, it is desirable for sponsors to build 
their repeat-dose clinica! protocols a coordinated evaluation of drug levels, receptor saturation, 
antidrug antibodies, and clinical responses. Optimally, these assessments would be conducted a 
initiation of therapy and at several time points over the course of therapy. The presence of antldn 
anObodies and their role in altering drug exposure, clinical activity endpoints, or adverse events v 
be evaluated. The goal of an integrated analysis of these parameters is to provide data to guide < 
dosage or schedule changes to optimize therapeutic benefit. The best time for conducting these 
pharmacokinetic studies is prior to phase 3, before commitments have been made regarding dos 
and schedule. 

Because polypharmacy Is common during the treatment of rheumatic disorders, In vitro binding 
studies with blood from patients with active disease should be used as a preliminary screening tc 
for potential displacement reactions. 

For products that may Interact with rheumatoid factors (e.g.. monoclonal antibodies), the frequen 
patients with rheumatoid factor reactive to the antibody, as well as the effect of interactions on th> 
pharmacokinetics of the product, should be evaluated. 



C. Considerations in Phase 1 Trials 
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For general information on clinical development pertaining to most drugs and ^U^Jgg^ 
the CDER guidance General Considerations forihe Clinical Evaluation of Drugs (FDA 1978). 

The term phase 1 has two connotations: one refers to the earliest, fir^-time-into-humans trials, w 
the other encompasses studies of pharmacokinetics, metabolism, drug interacts, special 
populations, and the other clinical pharmacology trials as described above, t is ex^^atbol 
kinds of phase 1 trials ordinarily wiU be conducted dunng the d.mcal evaluation of therapies for R 
This section is primarily intended to discuss issues related to the first time people are exposed to 
drug {including to a particular dose level, combination, or duration of therapy). 



1. Settings and Investigators 

Firet-time-into-humans phase 1 studies should be carried out in institutions with a full rang 
clinical and laboratory facilities, and the patients should be kept under dose observation. I 
desirable that the trials be under the direction of physicians with expenence in early drug 
development and rheumatology, or that a team of investigators combining expenence in 
rheumatology and clinical pharmacology be employed. 

2 Subjects Rrst-time-into-humans drug trials are frequently conducted In healthy volunteers 
Such studies are predicated upon the ability to perform and to interpret the results of pred 
animal tests. If the predinical testing does not reveal potential mutagenic or immune syste 
effects or potentially serious effects at or near the expected therapeutic range, testing In 
volunteers may be initiated. However, for products that have potentially serious toxicities, i 
may be appropriate for initial testing to be performed In patients with some potential to ber 
This has created challenges in selecting an appropriate initial patient population. 

For products that have been tested in relevant predinical toxicity evaluations and have be* 
found relatively safe (without the potential for mutagenic, immune system or other senous 
effects at the proposed doses), trials may be initiated in healthy volunteers. However, if 
significant effects have been demonstrated or might be possible, selection of an appropria 
patient population is necessary. It is recommended that patients meet the ACR criteria for 
diagnosis and activity of RA and be without other serious medical conditions. Patients with 
minimal disease are sometimes not appropriate for the same reasons that the testing is nc 
initiated in healthy volunteers. Patients with devastating RA may also not be the best start 
population because of the medical complications of their disease. In addition, they may be 
likely to respond to therapy. 

There is ongoing epidemiologic work on identifying markers of increased risk in RA. Thest 
could be useful for identifying patients with poor prognoses, who might be considered for \ 
aggressive treatments of potential high toxicity (e.g. f immunoablative therapies followed b) 
stem cell transplants). Application of epidemiologic studies may allow a very aggressive 
treatment to be restricted to a subset of RA patients who have a demonstrated shortened 
span due to their disease (e.g., subjects with greater than 30 affected Joints or a HAQ scoi 
with fewer than 75 percent of questions answered "without difficulty"). 

When the characteristics of the agent suggest that it may potentially have long-term gonac 
effects, it is desirable that men and women not wishing to parent children be chosen for pf 
1 studies. 



3. Trial design 

Ordinarily, Initial phase 1 studies are sequential dose escalation trials, in which safety and 
tolerance at a specific dose is established before exposing additional subjects to a higher 
dose. A single dose is almost always tested first, followed by repeated dose studies; howe 
this design is influenced by the type of agent used. Although escalating the dosage to a d« 
determined maximum-tolerated-dose (MTD) will aid future trial design, in some instances i 
not medically prudent to try to fully charaderlze the MTD. Additionally, for some products, 
MTD may be undefinabte. 

The starting dose chosen is often a no-adverse-effect dose (determined by interspecies 
milligrams (mg)/meter square/day dose conversion from animal to human). For biologies, I 
initial dose chosen is often one thought to have no adverse biological effect, with caution 
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regarding the possibBity of relative species specificity and comparing receptor avidity betw 
testspecies and humans. Conservative dose escalations (e.g. t half tog or less) are usually 
recommended. 



4. Concomitant therapy 

Use of low-dose corticosteroids (up to 10 mg prednisone equivalent daily) and NSAIDS m 
ordinarily be continued in phase 1 trials. Concomitant therapy with methotrexate and simil; 
agents should be avoided in initial phase 1 trials of all novel antirheumatic drugs, biologies 
and devices because of the difficulty of differentiating the toxicity of the novel agent from tl 
of the co-administered product. 

Physicians now prescribe methotrexate and similar agents earlier in the course of rheuma 
arthritis. Recruiting adequate numbers of patients not taking these agents may be difficult 
Approaches that may allow the use of methotrexate and similar agents in later phase 1 trfe 
include (a) obtaining reassuring evidence of lade of toxicity from relevant animal models in 
which co-administration occurred and (b) starting at doses significantly lower than the no- 
adverse-effect level of the single agent as determined by earlier phase 1 studies or preclin 
studies, or both. Such proposals should be discussed in the planning stages with Agency : 



5. Observations 



a. Safety 

The standard batteries of safety observations have been described elsewhere (ICH 
1994). However, additional types of safety observations may be necessary (e.g., te 
of effects on cellular and humoral immune function or host defenses). For products 
the potential for effects lasting long after administration, or for delayed toxicity, 
appropriate follow-up should be designed. For example, phase 1 studies of agents 
to deplete or modify the function of T-cell subsets should be designed to carefully 
assess both the short- and long-term effects on number and functional status (e.g., 
delayed type hypersensitivity responses) of cell populations and other pertinent 
pharmacodynamic assays during therapy and during follow-up. 

It is also desirable to incorporate individual patient adverse event stopping/Withdrav 
rules into protocol designs. In addition, it is often advisable to incorporate into trial 
designs rules for triad stopping or trial modification In case adverse events are obse 
For example, dose escalation rules should be clearly defined In dose-finding studie: 
with provisions for enrollment of additional patients at or below the dose-causing to: 
if possible significant adverse events are observed. 

It is desirable to develop a standardized toxicity grading scale for use in ail trials of 
product based on the known and suspected toxicities of the product or of the drug c 
This may improve consistency of adverse event reporting and allow more accurate 
comparisons among trials. 

b. Efficacy 



Developing an understanding of the agenfs therapeutic potential in early trials, is hi* 
desirable for efficient product development. This may be attempted in phase 1 , but 
ordinarily be achieved only by performing controlled trials. RA responses In open tri 
are of little value in indicating efficacy. Consideration should be given to the more 
modest goal of determining whether the pharmacological effect predicted from the 
preclinical development is present (proof of concept). 



D, Considerations in Phase 2 Trials 



During phase 2, larger, often longer, term trials are employed to better define the dose- and 
exposure-related activity and toxicity of the agent. Enough information should be generated to en 
that the phase 3 trials can be conducted safely and with a probability of success. In addition, pha 
trials should solidify a total drug development strategy to ensure that, after the phase 3 safety/effi 
trials are done, all of the Information needed for registration will have been gathered, including ar 
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appropriate safety database, clinical pharmacology, dose-response data, the exploration In sped 
populations (e.g.. renal failure, hepatic failure, pediatric patients), and information on drug Interac 
with agents expected to be co-administered. 

There is nothing to predude conducting additional phase 1 clinical pharmacology studies and phi 
trials while the phase 3 development is ongoing. 

The following issues are important for phase 2 trials in RA: 

1. Trial design 

Dose finding Is a central challenge of phase 2 development. Once a reasonably safe rang< 
doses has been established, randomized, parallel-arm dose-comparison trials are ordinari 
recommended. The use of a placebo arm is desirable for several reasons. First, if no 
difference is found among doses, there is usually no other way to determine whether all d< 
were equally effective or equally ineffective. Second, if a dose-response trend Is found, the 
placebo arm may indicate the possible magnitude of the observed effect If use of a placet 
Impossible, designs should include wide dose ranges or durations, or repetitions- Active 
controlled designs that specify an arm with a well-characterized, known therapy can also k 
very useful. 

Signs and symptoms measures may be used for dose-finding studies, but it Is possible the 
separate dose-finding studies may be needed for longer term endpoints. 

For agents that are thought to have prompt onset and rapid offset of effect, alternative des 
including crossover and titration designs, may be useful, although historically this has not I 
the case. Trials of two or more doses that permit liberal titrating per the patients* response 
unlikely to clearly demonstrate a dose response because these titration designs result in e 
blurring of any existing dose distinction. 

The desirability of identifying a range of doses with acceptable toxicity and reasonable act 
for study in phase 3 cannot be stressed enough. 

2. Safety 

Every RA investigational therapy raises safety concerns. Whenever there is a potential for 
significant toxicities, long-lasting or delayed-onset, it is desirable to design the phase 2 stu 
to provide a group of patients with longer term follow-up preceding the larger phase 3 stud 
Provisions for long-term follow-up can be helpful in addressing issues raised during prem? 
review (e.g., potential for Immunosuppression, opportunistic infections, neoplasia, and 
induction of autoimmune disease). Standard toxicity grading scales and stopping rules are 
desirable in phase 2. 

3. Additional development aspects 

a. Concomitant therapy 

Before starting phase 3 trials, an evaluation of the test product's interaction with oth 
agents likely to be used by the target population should be performed, initial inform; 
can be established based on metabolic pathways, studies of in vitro systems, anim; 
human pharmacology studies, or drug interaction studies. This type of information r 
in directing areas in need of clinical evaluation. When products are intended to be 
tested as combination therapy with the investigational agent, substantial informatioi 
interactions and safety of coadministration should be developed in phase 2. 

b. Gender effects 

Most RA trials have predominantly female enrollment. Sponsors should evaluate 
whether the observed safety and efficacy findings are restricted to women or can be 
also extrapolated to male subjects. This may be accomplished by subset analyses 1 
trials, PK data, or other information (FDA 1993). 
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E. Efficacy Trial Considerations 




desired laneiing ciaims. «ny auumuimi H ifl,,i,w » 
generate data supporting the claim should be identified. 



1. Global considerations 
a. Patient selection 

Activity of dbease: Unless some other specific subgroup is targeted, patients enroll 
efficacy trials should at a minimum meet the disease definition and disease activity 
defined by ACR criteria. Consultation with the Agency on the generalizability of claii 
derived from trials with significant limitations on entry criteria is recommended. 

To enhance the power of the trial, strategies to improve the chances of a response 
therapy are often employed. Some designs incorporate an attempt to select active 
patients by withdrawing background treatment and allowing patients to flare. Only 
individuals with sufficiently high scores are enrolled. The relevance of this type of 
observed flare is questionable, and its ability to predict the normal course of active 
disease has not been established. Many patients randomized to placebo in such sti 
exhibit the characteristic response of rapidly returning almost to baseline without fui 
treatment. In addition, when patients undergo blinded withdrawal from therapy withi 
these trials, similar dramatic flares are not observed. This raises the question of wh 
there is an expectation bias on the part of patients, who have been told about the ft 
procedure, and ascertainment bias on the part of investigators, who wish to have 
patients meet the entry criteria and enroll in the study. These uncertainties and 
instabilities around the outcome measures used in such trials should be kept In min 
when employing these designs. 

A proportionately smaller, but nevertheless noticeable and prompt, regression to th 
mean is noted in the joint scores of patients required to have a certain minimum val 
for trial entry in trials not employing a flare strategy. This means that patients, on th 
whole, will not actually have disease as active as anticipated when the entry enteric 
set 

Subgrouping patients by disease markers: RA is likely composed of a number of nrn 
or less distinct diseases delineated by a common genetic background, correspondii 
clinical manifestations, similar serologies, and responses to therapy and prognoses 
The study of RA possibly may be made more efficient with the use of markers with . 
prognostic significance as entry criteria to Increase patient homogeneity. Novel 
epidemiologic and molecular genetic approaches may lead to identification of even 
more subgroups. However, prospective studies are first needed to confirm the clink 
usefulness of new purported prognostic factors. Where existing data do support 
markers as prognostic indicators (risk factors), such as the presence of rheumatoid 
factor, erosive or vasculitic disease, and DR4 homozygosity, they can be taken into 
consideration in the design of trials, as can factors known to affect treatment respor 
Although in some cases such studies could limit generalizability and impact labelinc 
the final product, it is also possible that such targeting could improve the rlsk/benefi 
profile. 

b. Concomitant antirheumatic therapy 

Studies in RA patients, except In those with very mild disease, are carried out in th€ 
presence of concurrent active therapies, including steroids, NSAIDS, 
hydroxychloroquine, etc. This concurrent therapy creates numerous challenges in 
patient selection, toxicity monitoring, and clinical trial design. For example, since 
methotrexate therapy is used to treat many RA patients, it Is inevitable that new ag< 
wBI be used in combination with methotrexate in clinical practice unless a 
contraindication exists. Therefore, unless a prohibition on concurrent methotrexate 
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supportable, data regarding use of the investigational agent in combination with 
methotrexate are needed to evaluate the potential for immunosuppression from 
combination therapy. Other agents should be similarly evaluated. 

In addition, patients can be categorized according to their prior responses to stands 
therapy. Varying trial designs may help assess the response of different response 
categories to an investigational therapy. For example, with respect to methotrexate 
the RA population can be divided into five groups: (1 ) methotrexate noncandidates 
disease too mild or too early for methotrexate; (2) methotrexate candidates C disea 
sufficiently (or wilt become sufficiently) active to justify methotrexate; (3) methotrex? 
successes C disease reduced to negligible amounts; (4) methotrexate failures C cU 
drug failures, for inefficacy or tolerability, and (5) methotrexate partial responders C 
considerable residual disease despite methotrexate. Each of these groups might be 
considered separately for candidacy for an investigational agent and with respect tc 
appropriate trial design. If only a subpopulation of RA patients (e.g. methotrexate 
nonresponders) is studied in a particular trial, the results strictly reflect efficacy in th 
group only, but they may, of course, imply something about efficacy in other groups 
Single trials in various responder subpopulations could be supported by positive res 
in other subpopulations. Any planned subpopulations should be clinically 
distinguishable. Sponsors should consult Agency personnel when planning a clinic* 
development program contemplating an RA claim that Is limited to a subpopulation 
the disease. 

c. Other concomitant therapies 

Most patients with RA are taking concomitant medications. Use of medicines unlike 
influence treatment outcomes (e.g., antihypertensives) should simply be recorded, 
although Investigators should be alert for possible drug interactions. Obtaining 
information during clinical development on co-administration of the test medication 
expected concomitant medications is desirable. The following approaches may be 
considered in dealing with arthritis medications or analgesics. 

Prohibit use: This strategy may result in noncompliance or an increased number of 
dropouts. 

Incorporate protocol-specified use, with monitoring: With this strategy, additional 
analgesic use (and possibly other arthritis medications) may be used according to 
protocol-specified criteria. In addition, for long duration studies, protocols should 
address (1 ) whether intraarticular steroids are permitted and, if so, for how long 
assessments of the involved joint are excluded from analysis; (2) the manner in whi 
stress doses of corticosteroids for surgery, etc., are to be handled; and (3) how soo 
after such doses protocol assessments would be allowed. 

Design analgesic use, or its quantitative consumption, as (part of) an efficacy endp* 

Define use of more arthritis treatments as (part of) an efficacy endpoint. 

d. Stratification 

Randomization is intended to balance populations for confounding variables; howe\ 
there is always a chance that randomization may fail to achieve balance, particular! 
smaller trials. It may be advisable to stratify known (or highly suspected) major risk 
factors to ensure their balance across arms. Any factor whose influence on the outc 
is suspected to be as strong as the treatment's influence should be considered for 
stratification (e.g., erosive disease, presence of rheumatoid factor). An often overlo- 
risk factor is the patient's past therapeutic history. (See statistical section for further 
discussion.) 

e. Blinding 

Because most RA outcome measures have a high degree of subjectivity, the hlghe: 
confidentiality in patient and assessor blinding should be sought to achieve a credit 
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inferences-Blinding may be compromised if there is not an approximate parallelism i 
time to onset, nature of response, and toxicity profile between active and controlled 
interactions. Trials should have parallel <e.g. f "double dummy") dosing in all arms w 
possible so that a drug requiring frequent dose manipulations does not threaten the 
blind. If arm specific treatment adjustments are necessary (e.g.. per monitored dai£ 
levels), these can be done by an unblinded (and sequestered) third party to maintai 
patient and assessor blinding. In this case, parallel changes should be made as dui 
adjustments in the control arm to preserve blinding. Similarly, if the blind is likely to 
compromised by infusion-related events or other features of the treatment protocol, 
critical treatment endpolnts, such as joint counts, should be assessed by an 
independent party with no knowledge of the subject's history. 

f. Effects of dropouts and noncompliance. 

It is important that trials be designed to minimize dropouts and the attendant inform 
loss. Traditionally, recommended RA trial designs have focused on eliminating sow 
of variability, for example, extra pain medications, and intra-articular injections. Oftc 
these interventions were defined as major protocol violations, requiring that the pati 
be dropped from the study. There is a trade-off between patient retention and toieic 
of variability in RA trial design. Protocols demanding rigid adherence may yield 
uninterpretable results because of dropouts and noncompliance emanating from pa 
and investigator intolerance of the requirements. On the other hand, protocols 
permitting any kind of additional intervention may likewise be so confounded as to c 
interpretation. 

The following strategies may help minimize loss of information: 

i. Use screening or run-in periods so that patients are randomized to treatment 
groups only after their eligibility and commitment are confirmed. 

ii. Thoroughly train investigators and study personnel to minimize inappropriate 
enrollments, protocol violations, and other deviations that would decrease th 
ability to assess trial outcomes. 

Hi. Include dropouts in the definition of the endpoint, as in a time to defined 
treatment failure, or a defined by-patient success or failure. A sufficiently late 
time point should be chosen as the endpoint to avoid the situation where the 
course for response differs between two active therapies in a trial, which can 
introduce bias. 

One example of this approach would be to use a protocol-defined response t 
as the primary endpoint, wherein dropouts due to lack of efficacy are classify 
as nonresponders. With this type of endpoint, the criteria for classification as 
nonresponder should be clearly and prospectively defined. The use of this ty 
endpoint could be justified in situations where there are robust phase 2 data 
suggesting drug responsiveness at a defined point after initiation of therapy i 
durability of that response. In this case, one could define the primary analyst 
a comparison of the proportion of patients with an ACR 20 response at six 
months. The protocol could specify that if no improvement compared to base 
were seen on two consecutive study visits after two months on protocol, the 
subject would be declared a nonresponder. Experience will determine wheth 
this approach effectively limits information loss due to dropouts. 

iv. Make provisions for following patients who have stopped experimental treatr 
Options include allowing a protocol-specified crossover to a standard therap; 
patients meeting predefined criteria for treatment failure. 

v. Allow more flexibility in treatment options during the study. Some designs thi 
have been used include allowing dose adjustment of the comparator arm 
(assessor and patient blinded), allowing add-on therapy for patients meeting 
predefined criteria for inadequate response, and allowing a limited number o 
joint injections, with elimination of that joint from assessment. 
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2. Trial designs in RA 

Clinical trials in RA can be designed in a variety of ways. More than one claim can be purs 
in the same trial, and claims can be submitted singly or together. Trials can be designed tc 
a difference C demonstrating that the investigational product is superior to control (placebi 
lower test dose, another active agent), or they can be designed to test no difference C 
demonstrating that the product is adequately similar in efficacy to active control. Placebo, 
dose, concentration- or active-controlled designs can be used. 

Because the persuasiveness of trials showing a difference is, in general, much greater tha 
that of equivalence trials, it is highly desirable for a claim to be convincingly demonstrated 
least one trial showing superiority of the test agent over placebo or active control. If a clair 
superiority over a specific comparator is sought, rather than just straightforward efficacy, tt 
claim should be substantiated by two adequate and well-controlled trials showing superior 
Such trials can also be the basis for demonstrating the product's efficacy. 

a. Superiority trials 

The standard two-arm, investigational agent versus placebo design has been the rr 
common RA design and is the most straightforward. The details of trial design will 
depend on the population tested. Patients with mildly active RA taking only NSAiDS 
who have never been treated with an additional class of therapy, may be enrolled ir 
placebo-controlled trial with continuation of NSA1D background therapy; however, 
patients doing poorly on NSAIDs alone are usually not appropriate candidates for 
placebo-controlled trials. The same considerations apply to patients who are partial 
responders to, or who have failed, various other treatments. 

Alternatives to the two-arm difference design are a standard dose-response study s 
superior-to-active-control hypothesis. These designs may accommodate the need t 
provide active treatment to patient groups where randomization to placebo is infeas 

b. Equivalence trials 

Equivalence trials are designed to demonstrate that the test drug is adequately sim 
to an active control. This is done using a prospectively defined equivalence test, 
specifying to a 95 percent confidence level that the real difference between test anc 
control is smaller than some predetermined amount Achieving similar point estimat 
efficacy of two agents is not a demonstration of equivalence. 

Equivalence trials can serve two purposes. First, they can be used to supply evider 
for a simple efficacy claim. Second, they can be used to pursue a specific equivaler 
to drug X claim. Both purposes could also be pursued in the same trial. The imports 
point to note is that the strength of the evidence may need to be stronger for a 
claim of equivalence to drug X than for a simple efficacy claim. Thus, the 
equivalence test may differ, depending on which claim is Intended. Currently, the 
equivalence standard that is appropriate for a given trial in RA will be determined oi 
case-by-case basis. As noted above, this test may be more stringent if a claim of 
equivalence to drug X is being pursued. Additionally, the test of equivalence may bi 
constructed differently if a placebo arm is present, since the presence of the placeb 
allows estimates of absolute and relative drug effect size. 

In either case, the statistical test for equivalency needs to be quantitatively describe 
the protocol. Under either the pursuit of a simple efficacy claim or the pursuit of a 
specific equivalence to drug X claim, the basis of the decision on an appropriate tes 
remains, fundamentally, a clinical one. It represents a consensus, in that particular 
circumstance and for that particular claim, on what small potential difference can be 
considered clinically insignificant, to allow the treatments to be considered clinically 
equivalent 

There is considerable experience in the interpretation of active-controlled trials in c) 
situations where the response to the intervention is high. As an example, antibiotics 
evaluated by the Division of Anti-Infective Drug Products (CDER, FDA). For these 
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products, the magnitude of the potential difference permitted in an equivalence 
determination depends on the response rate of the standard treatment. For exampi 
new agent being compared to standards with response rates of 90 percent or more 
be allowed a 10 percent window to provide confidence that the difference between * 
standard response rate and the test response rate is no more than 10 percent. 
Technically, this means the 95 percent confidence interval of the difference must lie 
to the right of 10 percent If the standard treatment Is known to have an 80 to 90 pe 
response rate, a 15 percent window is used. These equivalence tests were design* 
trials without a placebo arm and for clinical situations where the placebo response i 
known to be very low. 

Treatment response rates in RA are often in the vicinity of only 50 percent (dependi 
of course, on the endpoint used) with placebo rates of about 20 percent, so the din 
decision for an allowable small difference may differ from that with antibiotic therapy 
this time, the decision will have to be individualized for each trial. 

A major problem in equivalency trials lacking a placebo arm Is ensuring that both 
treatments are equally effective, rather than equally ineffective. A number of the ag< 
that are approved for RA have fairly small effects and may fail to show efficacy whe 
tested against a placebo. Comparative trials intended to show equivalence to such 
treatments, when not anchored by a placebo control group, may lack credibility. Thi 
Is desirable in equivalence designs to select highly effective comparative agents. If 
possible, use of a third (placebo or lower dose) arm, so that a treatment difference < 
be shown, is a desirable strategy in equivalence trials. This arm would not necessa 
have as many patients or as long a duration as the active comparators. If a placebc 
is present, both the test and active arms need to statistically exceed placebo for a 
finding of equivalence to have meaning. 

Strict attention to numerous aspects of trial design and conduct are important to en: 
accurate inferences from equivalency trials. Design decisions regarding patient 
population, dosing, and efficacy and safety assessments should be done in a mann 
that fs unbiased against the control to ensure a fair comparison. Furthermore, atten 
to certain problems in trial conduct, such as minimizing dropouts, noncompliance, a 
missing data is essential to the reliability of the inference. These aspects of trial cor 
may obscure differences and lead to a false conclusion of equivalence. This is the 
opposite of their effect in a difference design to show superiority, where they work 
against trial success. 

In any particular development, the choice of trial design depends on many factors. 
Since controlled evidence showing a difference is more persuasive than that showii 
equivalence, greater efficiency (fewer patients or shorter exposures) is available wii 
development strategies using trials employing maximal differences between trial an 
Optimally, this means placebo controls, with the requisite background therapy, give 
all patients. 

c. Trial designs novel to the study of RA 

Although not used traditionally In the study of new RA treatments, the withdrawal d< 
can be considered in certain circumstances. The withdrawal design is sometimes u 
to assess efficacy. In this design, patients in both arms of a study are treated with tl 
investigational agent, which fs then blindly withdrawn from one arm, after which pat 
outcomes are compared. Showing that patients taken off the investigational drug gc 
worse demonstrates effectiveness. Natural endpoints for withdrawal designs are tin 
(predefined) worsening using standard time-to-occurrence statistical tests or a simp 
comparison of proportion of outcomes in the two arms. Withdrawal studies may be 
performed with both arms on background therapy. 

There are a number of caveats about withdrawal designs. If the product is very toxi> 
that only a small (tolerant) subset of the original population remains at the end of th 
trial and is available for the double-blind withdrawal phase, the generalization of an; 
inference from the withdrawal design is limited to that tolerant subset. Additionally, i 
should be noted that, if a drug Induces habituation or tolerance, withdrawal or rebot 
phenomena may make withdrawn patients worse even though drug therapy did not 
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have a beneficial effect. 
3, Anaiyticalissues 

a. Handling dropouts 

Historically RA trials have suffered from Information loss due to dropouts. Dropoufc 
^ES&Sr ooour randomly, and rarely occur fully "-j^ 
beina tested so there Is always the possibility that dropouts introduce a bias. This 
Sbffi^ many mndomtod trials. Methods for analyzing the effects of 

dropouts have been proposed, but none is fully adequate. 

The problem of dropouts is not resolved by using an intent-to-treat (i.e., all randoml 
oaflents included) analysis with an imputation by last-observation-carned-forward 
OTT/LOCF) or by showing that both the flT/LOCF and PP/OC (per protocol 
completers/observed cases only) analyses concur, although these approaches imq 
increase confidence in the results. It should also be noted that °* er ™ eth 
of modeling missing data, for example, see Little and Rubin <1987). Such modeling 
methods require assumptions that are nonverifiabie by existing data. 

The effects of dropouts should be addressed in all trial analyses to demonstrate tte 
conclusion Is robust. One trial design approach is following all patients, including 
dropouts to the planned trial endpoint, even if postdropout information is confound* 
new therapy, and performing an analysis including these patients. Another approac 
involves the worst case rule: assigning the best possible score to all postdropout 
placebo patients and the worst score to all postdropout treatment patients, then 
performing an analysis including these scores. 

b. Comparison to baseline outcome measures 

A phenomenon frequently observed in RA, as well as in other conditions, is that pat 
who stay in trials do better than those who drop out responders do better than 
nonresponders. This is true both for placebo groups and active treatment groups. If 
observations of the disease were made exclusively from clinical trials, one might 
conclude that the natural history of the disease is inexorable improvement This 
phenomenon is attributable to preferential dropout of worsening-patients (a 
phenomenon not adequately compensated for in LOCF analysts) as well as regresi 
to the mean. The problem is exacerbated in flare designs, where all patients have r 
improvement regardless of treatment status. This fact makes comparison-to-baselir 
outcome measures difficult to assess, since, very often, much of the improvement r 
has no relationship to a treatment effect For these reasons, active-controlled tnals 
incorporating a piacebo arm and using comparisons to baseline may be extremely 
difficult to interpret, especially if a flare design is employed. In any case, success in 
trial implies improvement over control. 

4, Statistical Considerations in Efficacy Trial Design 

It is advisable to discuss the design and analysis with the FDA review team prior to embar 
on a study. In addition, FDA's Guideline for Format and Content of the Clinical and Statist! 
Sections of New Drug Applications (1 988) contains useful information. 

a. Randomization/stratification 

Randomization is intended to allocate patients to treatment groups to avoid bias an 
ensure that statistical procedures can be appropriately applied. 

In some clinical trials, there are known factors that are at least as influential in 
controlling the observed severity of disease as the drugs being studied. Strattficatio 
may be used to avoid relying on randomization properties to balance patient assign 
for these factors. Stratification Is implemented by constraining simple randomizatior 
balance the assignment of patients to treatment groups for the chosen stratification 
factors. 
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Every phase 2 and phase 3 study protocol should contain *™*°^™*^™ n 
SSLirts imposed on the randomization should be explicitly identified. It can then 
inferred, when a stratification factor or sample size allocation constraint is not 
mentioned in a protocol, that there exists no corresponding randomization constra ir 
This applies to whether patients are blocked to balance treatment assignment for to 
of patient entiy into study and to the more obvious stratifications on center and bast 

Because stratification implies constraints on randomization, studies that have been 
stratified for certain fector(s) should account for these factors in the statistical analy 
section. The protocol-defined analysis should be implemented for each study. 

There are also statistical procedures to address bias In treatment group comparisoi 
adjusting for factors (covariates) that, like the stratification factors, are to be prespe 
in the protocol or by using a mechanism to determine a fixed number of covariates 
prespedfied. It Is important to prospectively identify covariates (or criteria for choos 
covariates) in the protocol In addition, statistical adjustment procedures should be 
with care and include appropriate verifications of assumptions. 

In deciding whether to stratify randomization In all clinical trials, practical judgment i 
required. There are reasons to choose stratification and reasons to choose statistic 
adjustments. 

The first advantage of stratification is that it avoids possibly major statistical adjustn 
of differential treatment effects. Stratification would essentially eliminate the effect c 
such adjustments before analysis began. Second, although stratification and statist 
adjustment procedures are both designed to remove bias in estimated treatment efl 
stratification is usually more powerful. This is because stratification leads to smaller 
variances of estimated treatment effects than does statistical adjustment without 
stratification. Finally, the inclusion of stratification factors into a statistical analysis n 
should result In increased power to detect effectiveness. 

Stratification becomes increasingly clumsy as the number of strata increases and, 
consequently, the available number of randomizable patients per cell decreases. In 
case, it is logistically simpler to not stratify, but to rely on statistical methods to adju 
these factors. 

The best approach may be to combine stratification, applied to a limited number of 
most influential prognostic factors, with statistical modeling. This protocol-defined 
statistical modeling would both account for stratification and be used to adjust for th 
effects of a parsimonious number of the most important remaining factors 

b. Identification of primary efficacy variables 

Each phase 2 or phase 3 study protocol should identify the primary and secondary 
efficacy variables. Primary efficacy variables are critical to the identification of the 
effectiveness of the product. Secondary efficacy variables are those that support th 
validity of the primary variables but are less critical in deciding if this product Is effe< 
It is helpful, but not necessary, that statistical evidence of efficacy be shown for 
secondary efficacy variables. 

c. Prespecification of statistical analysis 

Statistical analysis of clinical endpoints is part of the process for obtaining consistei 
and convincing evidence of product efficacy. These statistical analyses should not \ 
data driven. This Is Implemented by identifying, in each study protocol, before data 
available for analysis, a sufficient description of the statistical analyses of primary 
efficacy variables so that an independent statistician could perform the protocol 
analyses. A brief description of the statistical analyses should Include but not 
necessarily be limited to specifying: (1) the level of significance to be used; (2) whe 
statistical tests of hypothesis or confidence intervals win be 1 or 2 sided; (3) whethe 
Interim analyses are planned and, if so, how the tests of hypotheses and confideno 
intervals will be adjusted to account for interim looks at the data; (4) the mathematk 
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^eSod uSforcmSng typ4 1 error rates for multiple primary efficacy 
SS^^^uJ* maWn 8 multiple treatment comparisons. 

d. Multiple endpobits 

There has often been a clinical argument for using multiple endpoints to assess prit 
Sce of^effeSess in RA. The theoretical bases for such combination endpo 
ar^e?ot1& 

measures recommended in FDA's previous guidance (FDA 1988). Wal results were 
3eS mw* a conclusion of effectiveness when statistical evidence o f «ffl 
STSSn for aTleast three of the four measures: physician global assessment, pa 
global assessment, swollen Joint count, and painful joint count. 

Multivariate statistical methods are also available for analyzing the set oTP^'y 
efficacVvSaSes. Procedures are being developed for inferences denved from mul 
endpoint results. 

Efficacy variables can be combined within patients (composite endpoint). Such a fi> 
corSS Z Tfficacy measures should be well defined in the study protocol 
STpostte efficacy variables have the advantage of avoiding several statical and 
inferential difficulties associated with multiple endpoints. 

e. Dropouts 

Dropouts are patients who. after a certain period of time in a trial fail to provide icHn 
K data scheduled to be collected by the protocol. Frequently, dropouts occur 
S related to taking the assigned test drug (adverse effects or ack of efficacy) 
sIScTdropoTte do not usually ocolir randomly, the remaining patients constitute a 
5£d Sample of the patients originally randomized Unless trial *opoute «n b 
assumed to be random (which Is rare), no model used to impute the* effects can bt 
verified. 

Methods used to handle dropouts, such asLOCF and <»w!&V?%^™™* 
satisfactory even though they have often served as the basis for determining that 

does not preserve the size of the test, either for the comparison of fir«l observation 

for the comparison of rates of change. Alternative methods 

analysis and random effects regression. These are also susceptible to 

Sniring C that is. dropping out depends on the value of ^ responseJt is often i 

to show that the results hold for a variety of analyses (i.e.. they are robust). 

f. Trials with several treatment groups/multiple comparisons 

in clinical trials involving more than two treatment groups, a stofisflcal mumple 
comparison procedure controlling the experiment-wise error rate to 5 percent or les 
should be applied. In essence, there should be overall statistical evidence of a 
treatment main effect before attempting to make specific drug compansons relevan 
proposed drug labeling. 

g. Trials simultaneously used to pursue more than one daim 

A single trial can be used to pursue simultaneously more than one claim; an adjusto 
of sionrficance level for multiple analyses is not always necessary. If the order of tei 
the hypotheses is prespecified. then no penalty need be taken. For example, when 
trial is simultaneously pursuing a six-month signs and symptoms claim and a twelyt 
month x-ray claim, if the trial wins by the first hypothesis tested C signs and sympto 
C then the x-ray hypothesis can be calculated without an adjustment penalty. 



h. Interim analyses 
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Interim analyses are those that, for any purpose, are repeatedly nc 
Cumulating clinical trial efficacy data. Methods have been developed fcr compens 
Srthe feet that multiple tests (Including Interim analyses alter the true sentence 
tevel The study protocol should state whether such Interim anises are planned o 
Should interim analyses be planned, the plan and its Implemented* should be 
described In the protocol. The description should include who will have access to th 
interim data, the scheduling of these Interim analyses, the method to be applied for 
adjusting significance levels, and the corresponding time sequence of significance 
levels at which statistically significant results will be claimed. 

Although an interim analysis may not be thought to affect the subsequent collection 
efficacy data, Interim analyses carry the additional risk that the blinding or conduct i 
study may have been compromised. Statistical methods cannot compensate for an; 
unwinding and bias that may result from gathering the information needed to perfor 
Interim analysis. Finally, if any major protocol change becomes necess^y (e.g., a r 
therapy becomes available), it is important that such a change not be influenced by 
those unbllnded to data. 

Sample size 

Failure to recruit an adequate number of patients is a major reason why an effective 
product may fail to meet established statistical criteria for efficacy, Independent of 
whether the purpose was to show superiority or comparability of treatment effect. T 
method for determining the sample size should be stipulated in sufficient detail to p. 
independent verification of the computation. This should include Identifying the effic 
variable that the sample size determination is based on, the magnitude of the 
hypothesized clinical difference, the standard deviation, the power, the significance 
level, and the sidedness of the statistical procedure(s) described In the analysis pla 
Furthermore, the size of the clinical difference chosen should be justified, and the 
rationale for the choice of the efficacy variable used to determine sample size shou 
discussed. For comparability from one trial to the next, it is optimal to use the same 
efficacy variables as were used to power earlier studies. 



j. Trials to show clinical equivalence 

The words clinical egu/Va/ence are used in a much narrower sense than these wore 
might imply to the casual reader. First, there Is often no intent to show equivalence 
two or more drugs across the broad spectrum of pharmacologic effect. Rather, focu 
on showing no clinically relevant differences for one or possibly more variables that 
to be dearly identified in advance. The concept of equivalence is two-sided in that i* 
any outcome measure, one drug is sufficiently different from another drug, then the 
drugs are no longer deemed equivalent in that variable. 

To show equivalence, the variables serving to measure these effects of interest she 
be defined in the protocol. For each efficacy variable for which clinical equivalence 
effect is sought, the magnitude of a difference deemed to be inconsequential shouh 
identified. The clinical data should then show, with 95 percent confidence, that this 
predefined difference is not exceeded. 

Inference based on trials to show equivalence is Inherently less convincing than 
inference based on trials to show the existence of a difference. Often, clinical trials 
not detect treatment differences that are known to exist. In such cases, statistical 
methods may then seemingly provide evidence of equivalent effect (e.g., to placeb< 



k. Appropriateness of the statistical methodology 

The appropriateness of the statistical model should be assessed, including checkin 
outliers and determining if distributional assumptions (usually normality) are met an 
common variance assumptions hold 



I. Site effects 
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If the patients have been stratified and randomized by site the «^ Jj^j* 
site effect There may be a site-by-treatment interaction reflecting the degree to whi 
Se £££ varKoss sites.™* is often notable when there is a ^atvariat., 
enrolled patients across sites. Site-by-treatment interaction should be explored. 

F. Safety Analysis 

The approach to evaluating adverse event data and laboratory values has traditionally differed fr< 
KSSto Evaluate efficacy. The purree of safety evaluations Is usually not to test a specifi^ 
hvSis buTto ermine the pattern effects and to detect unusual or delayed events. Analys 

She? Stales may be helpful. The safety profile should address to wha exterrtadyerce event* 
?dnjiSadffsTteb^ 

HittoK or pSsTble drug interactions. Incidence rates should be calculated using denominator 
S reflect the period of drug exposure for the population at risk. Cumulative lncrfences(hamrd 
rateTin^r P KilWes) do a better job representing the temporal pattern of drug effects than « 
Seroe raS; comparative cumulative incidence tables C drug versus active control s). versu 
pHSS!cS!it very helpful to practitioners. Critical incidence rates should be descnbed with 
percent confidence intervals. 

Ensuring safety during the development of a drug, biologic, or device can be optimized by adequ 
pSSa? evaluations 9 and the development of a standardized clinical safety a^ = t system 
Elements of a successful safety assessment system Include the use of predefined standard 
tenrSnology (such as the Medfatf Dictionary for Regulatory Activitfei ; Terminology)™* cnteria tc 
oSe aTassess adverse events (AEs). approaches to optimize ,AE detertion^nc; ^appropriate 
safety stopping rules in trials. It Is also useful to capture AE severity (grades 1 [m Id}, 2 [moderate 
fseverel or 4 nife-threatening]). outcomes (such as the need for therapy and whether resolutory 
SXociirredrtreatlng physfcian assessment of association with study agent (remotely possib 
probably related), and Impact on the trial (none, dose of agent delayed or changed, or patient 
SSSawn from further therapy). Stopping rules, detained by the risk/benefl ^ for the agent 
the study population, are desirable both for Individual patients (a single grade 3-4 AE is often use 
as well as for the clinical trial, especially in dose-escalation studies. 

1 . Intrinsic trial design considerations 

An attempt should be made to characterize the patient population susceptible to adverse < 
effects Some extraneous factors, such as variations in soliciting and reporting adverse ev 
among the investigators and differences in the definition of normal range* for lab values 
among different laboratories can complicate the safety data. Since adjustment for their eff< 
may be difficult, precautions should be taken in the design stage of the trial to minimize tin 
influence of these factors by preparing clear and specific instructions for data collection an 
monitoring adherence of the investigators and the laboratories to the protocol. Procedures 
normalizing laboratory data, for example, may be employed. As previously mentioned, 
developing standardized toxicity grading scales that can be employed in all studies may al 
be useful. 

2. Adequate numbers 

The ability to detect adverse experiences is dependent on the number of patients evaluate 
the clinical trials and in clinical usage. In studies of 300 or more patients having adequate 
exposure to the investigational drug, it Is expected (with 95 percent confidence) that at lea 
one patient will manifest an adverse event having an incidence rate of 1 percent or greatei 
Smaller studies fail to meet even this minimal incidence detection standard. In most cases 
however, it is possible to combine studies of equal duration to establish adverse experien( 
rates. 

For any chronically administered product, the safety database should include at least 300 
patients treated with the maximally recommended dose for at least six months and at leas 
patients treated for at least twelve months (ICH E1 A 1 995). Larger and/or longer safety 
databases may be advisable for agents with known or potential safety problems. 
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IV. SPECIAL CONSIDERATIONS FOR BIOLOGICAL PRODUCTS 33 

Although mere are similarities between RA trial designs for drugs and biologies, biologies have special 
characteristics and problems that should be considered in their development. 

A. Species Specificity 

Tlie schemes used traditionally in determining the initial human dose may not pertain to biologies 
Biological agents may behave differently in animal models than in humans, depending on the 
physiologic relevance and avidity for the receptor of the ligand in the animal compared to the hun 
Immunogenic^ may also be species specific. 

B. Dose Responses 

. The dose-response curve may be steep and/br even hyperbolic, and an agent can be quite toxic 
levels just above those thought to show efficacy. 

C. Toxicity Response 

The toxicity response may be highly unpredictable and potentially very dangerous and may indui 
the risk of disease worsening. Agents may have narrow therapeutic windows. Biologies have the 
potential for disruption of immunologic and physlotogic processes. Monoclonal antibodies to cellu 
epitopes of the immune system, for example, or to TNF receptors may cause serious morbidity a 
doses only slightly higher than those that are efficacious with markedly less toxicity. 

D. Product Homogeneity 

This often plays a critical roie in activity and toxicity of a compound. Product alterations can greai 
affect physiologic activity. Thus, biologies should demonstrate lot-to-lot consistency to the extent 
possible while under development and be reasonably well characterized to be properly evaluatec 

E. The Role of Antibodies ...... 

If phase 2 data suggest that agent-Induced neutralizing antibodies could interfere with the efficae 
a biological agent over time, it may become necessary to formally investigate the possibility in a 
randornized-controlled setting. The occurrence of neutralizing antibodies may call for the 
reconsideration of doses and dose regimens. Non-neutralizing antibodies may have a profound e 
on PK and may therefore be just as important as neutralizing antibodies. 

Table of Contents 



V. SPECIAL CONSIDERATIONS FOR MEDICAL DEVICES 
A. Background 

Medical devices used in the treatment of RA vary considerably in their therapeutic impact, rangin 
from primary therapy for a specific affected joint to products used as adjuncts to primary therapie 
such as the use of ultrasound or heat for symptomatic relief in conjunction with pharmaceutical 
therapy. Variability in therapeutic effects due to disease and response heterogeneity may be 
exaggerated with some devices because treatments are often targeted to a single or a few joints 
patient 

Preclinical requirements for devices include assessment of data from a diverse range of tests, 
Including, for example, tests that evaluate chemical composition, mechanical reliability or the 
electrical properties of a given device. Each product type has an identified battery of tests that an 
evaluated to ensure that the design of the specific product is well understood and can be expect* 
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have the intended effect on patients as described In its labeling. 

The design of clinical trials in RA for a device may raise some unique issues rannm I J 
h! m«*£ rttaii Mate The interaction between treated and untreated oints as it affects functional 

Sr idhn dffia* questions when the active therapy understudy is an Implant and the availabl 
SsTe more sySlc in activity. Understanding how to design these trials, analyze the date 
and assess the contribution from each therapy can be challenging. 

B. Efficacy Considerations 

1 The selection of a control group may be quite challenging with devices because masking f 
* £ rSSd nvesOgator is frequently impossible and a placebo may be in feasible. Be* 

tSSSTJSSSU often unsatisfactory and evaluating the equivalence of local therapy 
Atomic active control poses problems, innovative trial designs are often needed. Some 
Sfiveoptlons include randomization to early versus tete ^interventions, or resci 
Uerventions following failure of pharmaceutical interventions. Mtoughtoe use .of a sham 
Santo) device is the most desirable control for many P^™*^*^**^ 
Epropriate If the subject device is implanted. Additionally, the success of patent and/or 
ph$fen masking with such shams may not always be complete. Patient masking is infec 
If the product requires a surgical or other invasive procedure. However. In some cases it is 
possible to achieve evaluator masking for the effectiveness evaluation. 

2 For devices intended for use as adjunctive therapy, approaches and analyste methods she 
be designed to account for differences in disease status and severity to minimize potential 
biases in outcome measures. One approach is to haye a ^^^f^^^J. 
named pharmaceutical to avoid confounding the analyses. The use of additional. potenMal 
confounding co-therapies (hot/cold treatments, splinting, physical therapy, and orthotics) a 
needs to be appropriately addressed during the study design phase. 

3 The use of quality of life (QOL) assessments is very important where devices are intendec 
rehabilitative purposes. QOL benefits for the intervention should be judged with tools chos 
for their validity, ease, and convenience of administration and the ability to address both 
improvement in QOL and product satisfaction. These assessments should be kept maskei 
from the independent effectiveness evaluator to avoid assessment bias. 

4 in the case of invasive devices necessitating in-hospital or in-office use.lt is recommendej 
* that dinical assessments include convenience in use and pain or discomfort in administrat 

Early assessment of this aspect of therapy can provide critical Information on the ultimate 
acceptability of the treatment and affect decisions as to whether to pursue a treatment 
modality. 

C. Safety Considerations 

1 Obtaining well-characterized, short-term adverse events rates, as an early assessment of 
' safety as Is common for large pharmaceutical trials is not always meaningful In medical d€ 

trials Device trials are frequently much smaller in size and the reliability of early data may 
be sufficient to establish a prospective pattern. When the device in question is an Implant, 
evaluation period must be fairly long to assesses the potential for late occurring adverse 
events Including device failure and adverse reactions to device materials. In some cases. 1 
full assessment of safety cannot be completed in the premarket phase and extends into 
mandatory postmarket surveillance. 

2 When devices are used in conjunction with another medical or surgical procedure, the 
distinction between an adverse event that is dearly device related and one that is commof 
the procedure may be difficult to assess. The nature, timing, and degree of severity are so 
of the factors that may be useful in reaching a determination of cause. Since these 
determinations are based on dinical judgment, a real potential for bias exists In reaching 
condusions about the treatment Care must be exercised In designing the protocol to prov 
adequate detail In Instruction to Investigators to allow for consistent and unbiased decisior 
making on their part 
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3 Devices (e.g..moseem^^^^ 

^.^^fiMment as to whether there is a fetime exposure limit or a maximum trequen. 
ZSSXSSm m TfSently addressed with animal data as well as with long-terr 

clinical evaluations. 



Jahle of Contents 



VI. SPECIAL CONSIDERATIONS FOR JUVENILE RHEUMATOID ARTHRITIS 
A. Background 

^^teZrim HLA-B27 4 and they have been furthersubdivided based on clinical courses (Ca 
S?mi?n^«c subsS aS 'correlate with these clinical course subsets and are als 
ScSSmU^^^^ various entities, polyarticular JRA is similar In ma, 
aS to^rr^paSlariy In clinical signs and symptoms, synovitfe and s.m.lar efficacy 
r^rSes to someexisting pharmacotherapy (NSAIDs. methotrexate and predn«one). As only 
KSo? aSerSSSi rheumatoid arthritis develop illnessonset during childhood, many 

SmCSm. in this small VO^^r^^J^JlSSSSZ 

tn ? OFR [Dart 316C Orphan Drugs. The application of principles In the conduct of chnical tnals i 
adul wSSSi S«S£2 SSoV and L section outlines only those ™ rfMnro fr 
ra Soonsors Tare aenerally encouraged to develop as much information as posslbte on JR 

S^^^^P^ for adU " ^ M 6 minimUm • d ° Sin9 ^ 
strongly encouraged. 

Conducting drug studies In children is generally necessary and consfetent with |>» mpe^ons c 
^^^nimAnft for this disease Because pediatric subjects constitute a vulnerable populatio 

StfSytf p£S£\ Ss p?b«shed guidelines for the ethical conduct of studies to evaluate dru 
rSoS owulaS (AAP 1995a), and general considerations for the clinical evaluation o drugs 
■S£K5l^ Guidelines reg^rdmg Inform, 

coSandaS 

SfSSC sS also be followed (AAP 1 995b). Conducting clinical trials ta r patl MM 
and particularly, assessing global disease activity and response to therapy should wolve pedial 
SSSogSor adult rheumatologfcts who have extensive training h pediatric rheumatology t 
have demonstrated competence in caring for children with rheumatic diseases. 

As a aeneral principle, children should not be subjected to an agent that has not been first tested 
wfety SnesSg may begin in children, however, when the anticipated benefits based on 
StetinT Sedge justify the anticipated risks. An agent developed specially for use in JRA (e 
^SSS^SSS^V^ a specific pathogenic process that is unique to JRA and not pre 
in aduttRA?mav 'wed to be tested first in chBdren, as exposure in adu t RA patients sven i r»rr 
a^K Steers may be unrevealing. If. however, the agent has potential for use in both adult R/ 
^X^Sum.fK-PD and I Initial phase 1 data Onduding m>*j^<^ »"£ 
wtor to the start of testing in children. JRA trials of drugs that are expected to be similar in efflcac 
Engdmgsand that do not represent rrajor therapeutic advanws or alternative approaches tc 
Sic mechinism of intervention can be delayed until there is extensive efficacy and safety data 
either from adults or in other pediatric populations. 

The need for reliable Inferences does not necessitate a placebo control, but randomization and 
SSnTshouSbe employed. The choice of control is a function of what is known about the agen 
Mao and what other treatments are available to potential trial enrollees If only an active i conti 
used for an equivalence trial, convincing evidence of the efficacy of the active ™^ shoutajte 
provided, and the test proposed to establish equivalence should be specified. If there have been 
prior adult studies, or if the agent underdevelopment has a novel mechanism of action orrepres. 
an entirely new class of drug, a randomized, double-blinded trial using either a 
control group of (anticipated) similar efficacy Is indicated. Open label extensions to oMain additto 
data about risk and persistence of benefit are very valuable. The use of active control (standards 
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SweeSySonverrtional or alternative treatment The sponsor should .nd.cate how dropouts w. 
handled in the analysis, whether from the escape clause, or otherwise. 

B. Applicability of Pediatric Regulation and Impact on Trial Design for JRA Studies 

Th* oed/afrfc use section in the labeling regulations (21 CFR 201 .57(0(9)) permite drug and blolo 
^S^SmSS^pS^ use if they have been demonstrated to be safe and effective f 

Dediafrfc rule may be applied only to obtain labeling for the signs and symptoms of JRA; other dc 
LdidS DrwSn ofSiotural damage, remission, and prevention of disability, should be 

efficacy data, additional pediatric dosing and safety evaluations are usually needed. 

in aeneral sponsors seeking approval for adult RA products appropriate for use in patients with . 

Z SfaS^dtetric use sections of the label. Specimen ^f^^SSSSSS^ 
SmificarSy if available data indicate that the coefficients are simter maduHs ^ cMdra «. 
XSm^ing techniques should be employed for such studies (Hashimoto 1991. The e xtent of 
safe^tesfing needed depends on the agent, its prior use. and any estabtehed safety -hotter 
oedtetnTSafions. Toxicity grading scales should be adjusted for pediatric populations. Phase 

X Tdwirablei mat as much efficacy evidence as possible be gathered during the evaluation of 
pediatric dosing and safety. 

For currently approved traditional (cyclooxygenase inhibitor) NSAIDs and c^icosteroids, adequ? 

exists to support a labeled indication for all JRA and all JRA subsets For 
™SexatewdsuSazine, adequate efficacy information exists for a labeled mdwation for J 
^SSSXSSSS^ course. For such agents, a labeling claim could be supported I usmg c 
PK PD. and safety data in JRA patients, although submission of additional JRA efficacy data is 
encouraged. 

For new agents (not yet approved for adult RA) that are not from a new pharmacologic class. adi 
efficacy data can be usedto support a signs and symptoms dun for particular is 
Sogical plausibility that the ajent would have a similar effect in JRA The ' "Pf*^ * *» _ 
pediatric rile to support a labeled indication for polyarticular course JRA will be based hon adult R 
efficacy data consWered on an Individual basis for each agent. When evdence for b»tog«al 
SaSry does not exist, evidence should be submitted to support the application erf the pedlatri 

to apply the pediatric rule.) Pediatric safety and dosing studies should be submitted. 

For agents in a new dass, efficacy studies should be performed in JRA to obtain an indication lot 
in JRA. The Indication will reflect the JRA subsets Included in the efficacystudy Sponsors who s 
approval for all JRA should include all JRA subsets In an efficacy study. The data couid support . 
daim for JRA (subsets not specified) provided that the data do not suggest thatthe ^gent is 
toeffedive in any one subset The label should reflect that efficacy was dernonsfrated and that th. 
agent is approved for JRA (subsets specified depending on which were included m the efficacy 
study). 

C. Outcome Variables and Claims 

It is possible for sponsors to seek approval for all JRA subsets or to seek approval for Individual 
subsets. In the former case, the label should note the number of patients from each subset enroll 
trials and the character of each subset response. Except as noted above in the application of the 
pediatric rule, all claims should be supported by an efficacy demonstration In the intended subsel 

1. Reduction in the Signs and Symptoms of JRA 
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All J RA trials should evaluate Improvement based on a validated CTd P dn ^[j^*J?J 
OmX^on^MM approach is the definition of improvement established I by jhe , 
eSUS UveVof sb? (MD global, parent/patient global, number of active ointe number of 
SiiSSSS!SSS£. functional ability, and ESR) improved lyMJMi 
nommftan one of six worsening by more than 30 percent (Glannlnl 1997). Because the 
SrfSSroveinent was validated using a trial of methotrexate, whk* primarily indu 
tSSSSmSS^SSSL. protocol indrvlduafcation may necessitate a refinement n the 
KnSr test for other patent subsets. For example for V^^^^^l 
tavorved and a normal ESR. use of joint and functional assessments W^*^***** 
fSSniwlSti if uve Us as coprimaiy endpotais may also be valuable (Undsley 1 9€ 
F^^th systemic o^ 

manifestations, and thrombocytosis/leucocytosis may be useful «>P nr "f2^tt£ ^ 
ffiUverman 1994) Outcome variables need to be appropriate and consistent with the type 
SnTStoi ^KSaSvestigators should specify, before the trial is Initiated, how mu 
change is considered clinically important for each outcome variable. 

Trials should generally last at least six months, except when slx^nth efficacy data exist 
adult RA and there are no reasons to expect loss of efficacy over time. Under these 
circumstances trial durations may be three months' blinded/randomized, but six-months' c 
sa7e^SuWbe Stained. As with adult RA. a three-month trial duration is suggestet 
NSAIDs. 

2. Major Clinical Response 

Similar to adult RA. major clinical response is a claim intended to connote that the agent 
provides substantial clinical benefit. Including In patients who are unable to completely res 
to the treatment or remit from the disease. At present, this claim is only theoretical, as dim 
JRA trial databases adequate for defining major clinical response do not exist. 

3. Complete Clinical Response 

The claim of complete clinical response reflects achievement on drug of six consecutive 
months of morning stiffness of less than 1 5 minutes duration, no active synovitis (pain, 
redness tenderness to palpation, swelling, stable or decreasing limitation of motion), no a 
articular features (including fever, serositis. adenopamy. hepatosplenomegaly. ra^.^ert, 
and normal laboratory parameters (including ESR. platelets. WBC) and. where applicable, 
ongoing structural damage while continuing on therapy. Trials should be at least one year 
duration Residual damage from prior disease, Including extra articular manifestations, is 
acceptable in meeting criteria for complete clinical response. Because spontaneous comp 
clinical response rates may be relatively high in JRA. these studies should be controlled. 

4. Remission 

Remission is characterized exactly as above, but while off all antirheumatic drugs. Remiss 
is not intended to imply cure. 

5. Prevention of Disability 

This claim is proposed to reflect durable improvement in physical function and disability in 
studies of one to two years' duration with demonstrated improvement In signs and symptoi 
over the same period. Instruments currently validated for use in JRA include the ChfloTioot 
Health Assessment Questionnaire (CHAQ). the Juvenile Arthritis Self-Report Index (JASI) 
(he Juvenile Arthritis Functional Assessment Report (JAFAR). HR-QOL should also be 
measured and demonstrated not to worsen over the trial duration. Endpoints should be ta 
to subtypes enrolled in trials (e.g., to assess knee function in paud-articular JRA patents l 
whom knee arthritis may be the primary arthritic manifestation). Instiuments should be 
developmental^ validated for the age ranges studied in a trial (Murray 1995). 

6. Prevention of Structural Damage 

Similar to adult RA, this claim would reflect trials of one year or more with concomitant sue 
in signs and symptoms. Currently, only sparse data exist regarding the usefulness of only 
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radloaraphic measure In JRA: the carpal-metacarpal distance In those pati ente w"h *J* 
S3T 8Sd *■* promising settings Include the evaluation of erosjvedteease j, 
™3cewlth polyarthritis, hip assessment in systemlcs. and knee assessments in pauct 
articular JRA. 

D. Trial Design Issues 

Recommendations for efficacy studies are based on the nature of the agent under development 
KoteTXed I for adult RA are generally applicable. Patients enrolled .n these trials may be c 
5» t rZ a course subset Separate trials for each JRA subset are recommended if the 
SSSS^^!SSiS^ of action that will not be ^applicable and equally 
eEdoS % atlJRA subsets. Alternatively, a single, sufficiently large trial wife enroUment 

stratified provides for useful conclusions to be reached about efficacy and safety to 
SS3 SSS Striates include disease course type, disease duration, and nonrespom 
prtor SotreSfe SatmenL Given that JRA Is an orphan disease, there is often some flexibility 
trial design, but this should be discussed on a case-by-case basis. 

At this time JRA patients are usually ineligible for entry into efficacy trials unless they have failed 
r^d%uatety to at least one standard second line agent (such as nrrcthotoexate at a dose « 

SdeLTa^rS efficacy could be obtained by using the agent very eariy in the disease cou 
el dence that delayed use In sicker patients potentially carries greater risk o toxurty. or evidence 
me aaent has a favorable safety and efficacy profile in a comparable population sludted to date a 
Sat ffaSf actions are potentially readily reversible. Pauci-artteular JRA patients are , parUcul 
eTcouragidfo^ncluslonin^ 

?SaMSsting therapy with an Improved safely profile, less frequent blood monitoring, and/or 
superior efficacy. 

Some JRA patients exposed to new agents should be evaluated periodically 'for an extended peri 
Effecteonsteletal growth, development, behavior, sexual maturation, reproductive capacity, and 
seSa!y malJnaScy should be included in such monitoring. Registries or cohort follow-up stud 
may be useful in providing long-term safety information. 

E. Concurrent Antirheumatic Agent Administration 

"The orinciples of use of concurrent antirheumatic therapy in JRA trials are similar-to those outline 
adult RA Citing ttieir discretionary use as much as reasonably possible so tha interpretation^ 
effic^arrisafety data is not compromised. However, limitations on concurrent medication canr 
2 iSly^JSedlreatmentiTnor should the protocol be made so unattractive to parents 
KfcSaTd^tients that enrollment is threatened. If background treatmert : te recessary, earl 
tolerance studies, to ensure safety of co-administration, should precede any large tnals. 

If patients receive concurrent slow-acting or prednisone therapy the dose should be ?^le prior • 
stS entoy and should preferably remain so throughout the trial. Concurrent medications are usu 
frrSSro^ost^ so may need stratification. If possible. Intra-artlcular steroid Injection 
should be disallowed for a minimum of one month prior to beginning expenmental therapy; othen 
that joint should be discounted in assessing therapeutic effects. 

F. Multicenter Trials and Center Effects 

Although JRA is the most common rheumatic disease of childhood, its prevalence is ; low compan 
adult RA Thus, trials of JRA that require large numbers of patients will likely be multicenter tnals 
Multicenter trials should employ a standardized protocol and data collection forms among a I can- 
Pretrial meetings of all investigators and other involved personnel are strongly encouraged to en. 
urtformity in protocol Interpretation, patient evaluation, and data recording. Studies have shown t 
IS cooperative group, a centers performance Is a function of the number of patients enrolle 
the center (Sylvester 1981). Thus, studies that use fewer centers with greater numbers of patient 
each center are preferable to those that use large numbers of centers with fewer patients. Effort 
should be made to enroll at least 10 to 12 patients at each center to provide for greater quality 
assurance. In all multicenter trials, center effects should be examined. A therapy should show eff 
In more than one center. When stringent entrance criteria restrict the number of patients eligible 1 
study many centers may be unable to enroll even 1 0 patients. In such situations, randomization 
blocked within Individual centers, rather than across all centers, may help to reduce the potential 
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APPENDIX A: COMPARATIVE TRIAL RESPONSE RATES 5 



Three cooperative systematic studies of rheumatic diseases (CSSRD) trials: 
(1) Methotrexate vs. Placebo 
(2) Gold, Auranofln vs. Placebo 
(3) D-Penicillamine high, low vs. Placebo 

Response Rates at End of Trial Based on Different Definitions of Improvement 
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PLACEBO 


(Auranofin) 
(Low-dose D- 

rciiivillal i m wsj 


High-dose 
D-Penlcillamine 
Gold Methotrexate 


Definition ot 
Improvement 




n=18 


n=155 


ACR5* 20% 


10(8.4%) 


30 (25.4%) 


64(40.3%) 


ACR£ 30% 


.5(4.2%) 


14 (12.0%) 


46 (29.7%) 


ACR2* 40% 


2(1.7%) 


7(3.4%) 


18(11.6%) 


ACRS* 50% 


0(0%) 


4(3.4%) 


14(9.0%) 


ACR£ 60% 


0 (0%) 


3 (2.5%) 


4 (2.6%) 


ACR> 70% 


0 (0%) 


0(0%) 


1 (0.6%) 
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APPENDIX A (cont.) 

COMPARATIVE MULTICENTER TRIAL OF 
AURANOFIN/METHOTREXATE 
(END OF TRIAL) 

Response Rates at End of Trial Based on Different Definitions of Improvement 



Definition of 


Auranofin 


Methotrexate 


Improvement 


(n=118) 


(n=119) 


ACR> 20% 


34(28,8%) 


77(64.7%) 


ACR2> 30% 


30 (25.4%) 


65(54,6%) 


ACRS* 40% 


22 (18.6%) 


51 (42.9%) 


ACR2* 50% 


21 (17.8%) 


42 (35.3%) 


ACR2* 60% 


9 (7.6%) 


22(18.5%) 


ACR> 70% 


7(5.9%) 


11(9.2%) 
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APPENDIX A (cont.) 

COMPARATIVE TRIAL OF CYCLOSPOR1NE-A 
METHOTREXATE VS. METHOTREXATE ALONE 

Response Rates at End of Trial Based on Different Definitions of Improvement 

Percent Increase bv ACR Patients Satisfying Criteria 

Criteria 
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Methotrexate* 


Methotrexate* 




Cycfosporine-A 




(n=71) 


(n=74) 


0% 


81 J 


50.0 


10% 


49.3 


16.2 


20% 


45.0 


12.2 


30% 


33.8 


8.1 






2.7 


50% 


22.5 


2.7 


60% 


5.6 


2.7 


70% 




0.0 


80% 


0.0 


0.0 


90% 


0.0 


0.0 



A This guidance has been prepared by the Rheumatology Working Group of the Medical Policy Coordinating Committee (MPCQof the Center for 
Evaluation and Research (ODER), the Center for Biologies Evaluation and Research (CBER). end the Center for Devices and Radiological Health 
(CDRH). 



2 The ACR definition of Improvement (ACR 20) is 20 percent Improvement in tender and swollen joint counts and 20 percent Improvement in three 
five remaining core set measures: patient and physician globals, pain, disability, and an acute phase reactant (Felson 1993, 1995). For specific de 
on individual measures, e.g., disability, acute phase reactant refer to Felson 1995. 



(CH documents are available via the FDA internet home page at http:Vwww.rda.gov/cder or cber. 



* The HLA-B27 subset Is not addressed in this document. 



5 CSSRD database and Tugwell et al., "Combination Therapy with Cyclosporins and Methotrexate in Severe Rheumatoid Arthritis,*' N Engl J Mad 
333:137-141. 1995. 
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